Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 4385 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 2 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 342.6 KiB |
| Average record size in memory | 80.0 B |
Variable types
| Numeric | 9 |
|---|
| Dataset has 2 (< 0.1%) duplicate rows | Duplicates |
FREQUENCIA BOMBA 2 is highly overall correlated with VAZÃO DE RECALQUE - FT03 and 1 other fields | High correlation |
NIVEL DO RESERVATÓRIO - LT01 is highly overall correlated with VAZÃO DE RECALQUE - FT03 and 1 other fields | High correlation |
VAZÃO DE RECALQUE - FT03 is highly overall correlated with FREQUENCIA BOMBA 1 and 5 other fields | High correlation |
PRESSÃO DE SUCÇÃO - PT01 is highly overall correlated with NIVEL DO RESERVATÓRIO - LT01 and 3 other fields | High correlation |
PRESSÃO DE RECALQUE - PT02 is highly overall correlated with FREQUENCIA BOMBA 1 and 4 other fields | High correlation |
FREQUENCIA BOMBA 1 is highly overall correlated with VAZÃO DE GRAVIDADE - FT02 and 2 other fields | High correlation |
VAZÃO DE GRAVIDADE - FT02 is highly overall correlated with FREQUENCIA BOMBA 1 and 3 other fields | High correlation |
FREQUENCIA BOMBA 1 has 392 (8.9%) zeros | Zeros |
FREQUENCIA BOMBA 2 has 1136 (25.9%) zeros | Zeros |
FREQUENCIA BOMBA 3 has 3687 (84.1%) zeros | Zeros |
VAZÃO DE ENTRADA- FT01 has 795 (18.1%) zeros | Zeros |
VAZÃO DE GRAVIDADE - FT02 has 272 (6.2%) zeros | Zeros |
PRESSÃO DE RECALQUE - PT02 has 79 (1.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-28 22:28:26.435437 |
|---|---|
| Analysis finished | 2022-11-28 22:28:46.848723 |
| Duration | 20.41 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
| Distinct | 1196 |
|---|---|
| Distinct (%) | 27.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.939888 |
| Minimum | 0 |
|---|---|
| Maximum | 59.988281 |
| Zeros | 392 |
| Zeros (%) | 8.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 57.842842 |
| median | 57.988792 |
| Q3 | 57.988792 |
| 95-th percentile | 58.317091 |
| Maximum | 59.988281 |
| Range | 59.988281 |
| Interquartile range (IQR) | 0.14595032 |
Descriptive statistics
| Standard deviation | 17.142205 |
|---|---|
| Coefficient of variation (CV) | 0.33003932 |
| Kurtosis | 5.1385932 |
| Mean | 51.939888 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.6473655 |
| Sum | 227756.41 |
| Variance | 293.8552 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 57.98879242 | 2632 | |
| 0 | 392 | 8.9% |
| 59.98828125 | 45 | 1.0% |
| 58.01076508 | 13 | 0.3% |
| 49.99084473 | 13 | 0.3% |
| 58.05471039 | 12 | 0.3% |
| 44.99212646 | 11 | 0.3% |
| 29.99230957 | 10 | 0.2% |
| 58.12062836 | 9 | 0.2% |
| 34.99102783 | 9 | 0.2% |
| Other values (1186) | 1239 |
| Value | Count | Frequency (%) |
| 0 | 392 | |
| 0.01275072433 | 1 | < 0.1% |
| 0.01913495362 | 1 | < 0.1% |
| 0.02551918104 | 1 | < 0.1% |
| 0.03190341219 | 1 | < 0.1% |
| 0.04066173732 | 1 | < 0.1% |
| 0.0472676903 | 1 | < 0.1% |
| 0.06109762564 | 1 | < 0.1% |
| 0.07513397187 | 1 | < 0.1% |
| 0.08153351396 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 59.98828125 | 45 | |
| 59.98095703 | 2 | < 0.1% |
| 59.9793396 | 1 | < 0.1% |
| 59.97729492 | 2 | < 0.1% |
| 59.97363281 | 1 | < 0.1% |
| 59.94200516 | 1 | < 0.1% |
| 59.94067383 | 1 | < 0.1% |
| 59.92602539 | 1 | < 0.1% |
| 59.92236328 | 2 | < 0.1% |
| 59.89031601 | 1 | < 0.1% |
| Distinct | 3190 |
|---|---|
| Distinct (%) | 72.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.81337 |
| Minimum | 0 |
|---|---|
| Maximum | 59.991943 |
| Zeros | 1136 |
| Zeros (%) | 25.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 34.908695 |
| Q3 | 38.02359 |
| 95-th percentile | 49.226396 |
| Maximum | 59.991943 |
| Range | 59.991943 |
| Interquartile range (IQR) | 38.02359 |
Descriptive statistics
| Standard deviation | 17.608565 |
|---|---|
| Coefficient of variation (CV) | 0.63309714 |
| Kurtosis | -0.92343398 |
| Mean | 27.81337 |
| Median Absolute Deviation (MAD) | 4.5594673 |
| Skewness | -0.71795914 |
| Sum | 121961.63 |
| Variance | 310.06157 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1136 | 25.9% |
| 29.99597168 | 12 | 0.3% |
| 39.99707031 | 10 | 0.2% |
| 57.99245453 | 8 | 0.2% |
| 59.99194336 | 7 | 0.2% |
| 44.99578857 | 6 | 0.1% |
| 34.99468994 | 6 | 0.1% |
| 24.99725342 | 3 | 0.1% |
| 36.11528015 | 3 | 0.1% |
| 49.99450684 | 3 | 0.1% |
| Other values (3180) | 3191 |
| Value | Count | Frequency (%) |
| 0 | 1136 | |
| 0.0004281122528 | 1 | < 0.1% |
| 0.0005553666269 | 1 | < 0.1% |
| 0.001183174783 | 1 | < 0.1% |
| 0.003486369271 | 1 | < 0.1% |
| 0.003600863041 | 1 | < 0.1% |
| 0.008166855201 | 1 | < 0.1% |
| 0.01569584385 | 1 | < 0.1% |
| 0.02166323923 | 1 | < 0.1% |
| 0.02376453392 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 59.99194336 | 7 | |
| 59.98828125 | 3 | |
| 59.98324203 | 1 | < 0.1% |
| 59.98095703 | 1 | < 0.1% |
| 59.97058105 | 1 | < 0.1% |
| 59.96425629 | 1 | < 0.1% |
| 59.96066284 | 1 | < 0.1% |
| 59.95888519 | 1 | < 0.1% |
| 59.95774841 | 1 | < 0.1% |
| 59.95323944 | 1 | < 0.1% |
FREQUENCIA BOMBA 3
Real number (ℝ)
| Distinct | 638 |
|---|---|
| Distinct (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4058755 |
| Minimum | 0 |
|---|---|
| Maximum | 59.988281 |
| Zeros | 3687 |
| Zeros (%) | 84.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 50.017325 |
| Maximum | 59.988281 |
| Range | 59.988281 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 16.765124 |
|---|---|
| Coefficient of variation (CV) | 2.6171479 |
| Kurtosis | 3.2320295 |
| Mean | 6.4058755 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.2674761 |
| Sum | 28089.764 |
| Variance | 281.06937 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3687 | |
| 57.98879242 | 36 | 0.8% |
| 0.1318343282 | 10 | 0.2% |
| 59.98828125 | 7 | 0.2% |
| 39.9934082 | 5 | 0.1% |
| 54.98956299 | 4 | 0.1% |
| 29.99230957 | 3 | 0.1% |
| 0.1245101988 | 2 | < 0.1% |
| 34.99102783 | 2 | < 0.1% |
| 0.1272614598 | 1 | < 0.1% |
| Other values (628) | 628 | 14.3% |
| Value | Count | Frequency (%) |
| 0 | 3687 | |
| 0.0001107446224 | 1 | < 0.1% |
| 0.001534604002 | 1 | < 0.1% |
| 0.005035049282 | 1 | < 0.1% |
| 0.006258170586 | 1 | < 0.1% |
| 0.007479577791 | 1 | < 0.1% |
| 0.009604785591 | 1 | < 0.1% |
| 0.0099241063 | 1 | < 0.1% |
| 0.01236863434 | 1 | < 0.1% |
| 0.01295140106 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 59.98828125 | 7 | |
| 59.96926498 | 1 | < 0.1% |
| 59.95925522 | 1 | < 0.1% |
| 59.94100571 | 1 | < 0.1% |
| 59.87623978 | 1 | < 0.1% |
| 59.86713791 | 1 | < 0.1% |
| 59.86031342 | 1 | < 0.1% |
| 59.85322952 | 1 | < 0.1% |
| 59.85174561 | 1 | < 0.1% |
| 59.83979797 | 1 | < 0.1% |
NIVEL DO RESERVATÓRIO - LT01
Real number (ℝ)
| Distinct | 4372 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2355002 |
| Minimum | 0.29407585 |
|---|---|
| Maximum | 4.4049139 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0.29407585 |
|---|---|
| 5-th percentile | 1.9517602 |
| Q1 | 2.7917631 |
| median | 3.30353 |
| Q3 | 3.7749107 |
| 95-th percentile | 4.2591256 |
| Maximum | 4.4049139 |
| Range | 4.1108381 |
| Interquartile range (IQR) | 0.98314762 |
Descriptive statistics
| Standard deviation | 0.69706843 |
|---|---|
| Coefficient of variation (CV) | 0.21544379 |
| Kurtosis | -0.032666725 |
| Mean | 3.2355002 |
| Median Absolute Deviation (MAD) | 0.4901371 |
| Skewness | -0.55811943 |
| Sum | 14187.668 |
| Variance | 0.48590439 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3 | 4 | 0.1% |
| 4.300000191 | 4 | 0.1% |
| 3.920138836 | 2 | < 0.1% |
| 4.047839642 | 2 | < 0.1% |
| 3.236786366 | 2 | < 0.1% |
| 4.295138836 | 2 | < 0.1% |
| 3.432870388 | 2 | < 0.1% |
| 3.836458206 | 2 | < 0.1% |
| 3.001000643 | 2 | < 0.1% |
| 3.768044233 | 1 | < 0.1% |
| Other values (4362) | 4362 |
| Value | Count | Frequency (%) |
| 0.2940758467 | 1 | |
| 0.3723406196 | 1 | |
| 0.4662296474 | 1 | |
| 0.4815826118 | 1 | |
| 0.8559085727 | 1 | |
| 0.8737350106 | 1 | |
| 0.8987794518 | 1 | |
| 0.9570472836 | 1 | |
| 0.9613910913 | 1 | |
| 0.9710686803 | 1 |
| Value | Count | Frequency (%) |
| 4.404913902 | 1 | |
| 4.403257847 | 1 | |
| 4.401571751 | 1 | |
| 4.401538372 | 1 | |
| 4.401222229 | 1 | |
| 4.400625229 | 1 | |
| 4.398981094 | 1 | |
| 4.398623466 | 1 | |
| 4.397883892 | 1 | |
| 4.39741993 | 1 |
VAZÃO DE ENTRADA- FT01
Real number (ℝ)
| Distinct | 1922 |
|---|---|
| Distinct (%) | 43.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 112.66294 |
| Minimum | 0 |
|---|---|
| Maximum | 383.87036 |
| Zeros | 795 |
| Zeros (%) | 18.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.11574074 |
| median | 0.11574074 |
| Q3 | 264.27155 |
| 95-th percentile | 280.08785 |
| Maximum | 383.87036 |
| Range | 383.87036 |
| Interquartile range (IQR) | 264.1558 |
Descriptive statistics
| Standard deviation | 132.60141 |
|---|---|
| Coefficient of variation (CV) | 1.1769746 |
| Kurtosis | -1.8552865 |
| Mean | 112.66294 |
| Median Absolute Deviation (MAD) | 0.11574074 |
| Skewness | 0.34275668 |
| Sum | 494026.98 |
| Variance | 17583.135 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.1157407388 | 1652 | |
| 0 | 795 | |
| 0.2314814776 | 4 | 0.1% |
| 1.273148179 | 3 | 0.1% |
| 276.5046387 | 2 | < 0.1% |
| 261.458313 | 2 | < 0.1% |
| 264.6990662 | 2 | < 0.1% |
| 276.2731628 | 2 | < 0.1% |
| 263.541687 | 2 | < 0.1% |
| 0.3472222388 | 2 | < 0.1% |
| Other values (1912) | 1919 |
| Value | Count | Frequency (%) |
| 0 | 795 | |
| 0.04144435376 | 1 | < 0.1% |
| 0.0578703694 | 1 | < 0.1% |
| 0.1157407388 | 1652 | |
| 0.1275791973 | 1 | < 0.1% |
| 0.1383209676 | 1 | < 0.1% |
| 0.1736586094 | 1 | < 0.1% |
| 0.176884532 | 1 | < 0.1% |
| 0.1776106358 | 1 | < 0.1% |
| 0.1835666746 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 383.8703613 | 1 | |
| 381.5904236 | 1 | |
| 376.8986206 | 1 | |
| 374.4212952 | 1 | |
| 370.3518372 | 1 | |
| 367.4073792 | 1 | |
| 366.7018738 | 1 | |
| 366.4682617 | 1 | |
| 365.7449341 | 1 | |
| 364.6219177 | 1 |
| Distinct | 4114 |
|---|---|
| Distinct (%) | 93.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 132.94359 |
| Minimum | 0 |
|---|---|
| Maximum | 326.1713 |
| Zeros | 272 |
| Zeros (%) | 6.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 123.97898 |
| median | 136.00012 |
| Q3 | 148.20116 |
| 95-th percentile | 179.92901 |
| Maximum | 326.1713 |
| Range | 326.1713 |
| Interquartile range (IQR) | 24.222176 |
Descriptive statistics
| Standard deviation | 44.78165 |
|---|---|
| Coefficient of variation (CV) | 0.336847 |
| Kurtosis | 4.9098637 |
| Mean | 132.94359 |
| Median Absolute Deviation (MAD) | 12.126892 |
| Skewness | -0.67329216 |
| Sum | 582957.65 |
| Variance | 2005.3962 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 272 | 6.2% |
| 108.8570862 | 1 | < 0.1% |
| 141.1200867 | 1 | < 0.1% |
| 140.4069519 | 1 | < 0.1% |
| 128.144989 | 1 | < 0.1% |
| 125.5947037 | 1 | < 0.1% |
| 153.2140656 | 1 | < 0.1% |
| 143.0213776 | 1 | < 0.1% |
| 144.0009308 | 1 | < 0.1% |
| 146.4695892 | 1 | < 0.1% |
| Other values (4104) | 4104 |
| Value | Count | Frequency (%) |
| 0 | 272 | |
| 27.51053429 | 1 | < 0.1% |
| 30.12467003 | 1 | < 0.1% |
| 30.16630745 | 1 | < 0.1% |
| 30.91310501 | 1 | < 0.1% |
| 31.05513191 | 1 | < 0.1% |
| 41.97084808 | 1 | < 0.1% |
| 56.65369415 | 1 | < 0.1% |
| 56.89728165 | 1 | < 0.1% |
| 57.67729187 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 326.1712952 | 1 | |
| 324.9286499 | 1 | |
| 322.9801636 | 1 | |
| 320.3776245 | 1 | |
| 304.5761719 | 1 | |
| 302.353302 | 1 | |
| 302.0870361 | 1 | |
| 301.3987427 | 1 | |
| 300.1445923 | 1 | |
| 299.913208 | 1 |
VAZÃO DE RECALQUE - FT03
Real number (ℝ)
| Distinct | 4114 |
|---|---|
| Distinct (%) | 93.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 112.40697 |
| Minimum | 0 |
|---|---|
| Maximum | 194.35185 |
| Zeros | 24 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.028935185 |
| Q1 | 111.65463 |
| median | 118.82233 |
| Q3 | 125.62949 |
| 95-th percentile | 136.53865 |
| Maximum | 194.35185 |
| Range | 194.35185 |
| Interquartile range (IQR) | 13.974861 |
Descriptive statistics
| Standard deviation | 31.328318 |
|---|---|
| Coefficient of variation (CV) | 0.27870442 |
| Kurtosis | 7.2635604 |
| Mean | 112.40697 |
| Median Absolute Deviation (MAD) | 6.9732361 |
| Skewness | -2.6536729 |
| Sum | 492904.54 |
| Variance | 981.46349 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.0289351847 | 226 | 5.2% |
| 0 | 24 | 0.5% |
| 117.8530121 | 3 | 0.1% |
| 119.1956024 | 2 | < 0.1% |
| 132.5810242 | 2 | < 0.1% |
| 113.6284714 | 2 | < 0.1% |
| 114.9884186 | 2 | < 0.1% |
| 110.1273193 | 2 | < 0.1% |
| 132.378479 | 2 | < 0.1% |
| 117.2742996 | 2 | < 0.1% |
| Other values (4104) | 4118 |
| Value | Count | Frequency (%) |
| 0 | 24 | 0.5% |
| 0.01446759235 | 1 | < 0.1% |
| 0.0289351847 | 226 | |
| 0.2875666618 | 1 | < 0.1% |
| 0.3858614862 | 1 | < 0.1% |
| 0.4217657745 | 1 | < 0.1% |
| 0.5559648871 | 1 | < 0.1% |
| 0.6147419214 | 1 | < 0.1% |
| 0.69016397 | 1 | < 0.1% |
| 0.8436223269 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 194.3518524 | 1 | |
| 189.3903809 | 1 | |
| 188.7152863 | 1 | |
| 185.4285889 | 1 | |
| 183.7471924 | 1 | |
| 183.2344971 | 1 | |
| 181.8721008 | 1 | |
| 180.8506927 | 1 | |
| 179.4629211 | 1 | |
| 177.5573273 | 1 |
PRESSÃO DE SUCÇÃO - PT01
Real number (ℝ)
| Distinct | 4364 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.1077789 |
| Minimum | 0.87751222 |
|---|---|
| Maximum | 5.6827645 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0.87751222 |
|---|---|
| 5-th percentile | 2.7513295 |
| Q1 | 3.6190748 |
| median | 4.147234 |
| Q3 | 4.6634259 |
| 95-th percentile | 5.2179035 |
| Maximum | 5.6827645 |
| Range | 4.8052523 |
| Interquartile range (IQR) | 1.0443511 |
Descriptive statistics
| Standard deviation | 0.76275458 |
|---|---|
| Coefficient of variation (CV) | 0.1856854 |
| Kurtosis | 0.24480127 |
| Mean | 4.1077789 |
| Median Absolute Deviation (MAD) | 0.52290487 |
| Skewness | -0.41059959 |
| Sum | 18012.61 |
| Variance | 0.58179454 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 5.541088104 | 7 | 0.2% |
| 5.532407284 | 6 | 0.1% |
| 5.636573792 | 4 | 0.1% |
| 5.55555582 | 3 | 0.1% |
| 5.53819418 | 2 | < 0.1% |
| 4.202215195 | 2 | < 0.1% |
| 4.284318924 | 2 | < 0.1% |
| 5.182291985 | 2 | < 0.1% |
| 4.845046043 | 2 | < 0.1% |
| 5.092852592 | 1 | < 0.1% |
| Other values (4354) | 4354 |
| Value | Count | Frequency (%) |
| 0.8775122166 | 1 | |
| 0.8825973868 | 1 | |
| 0.8876825571 | 1 | |
| 0.8906169534 | 1 | |
| 0.892767787 | 1 | |
| 0.8949127793 | 1 | |
| 0.8992086649 | 1 | |
| 0.9035044909 | 1 | |
| 1.519142866 | 1 | |
| 1.598580122 | 1 |
| Value | Count | Frequency (%) |
| 5.68276453 | 1 | |
| 5.668966293 | 1 | |
| 5.668751717 | 1 | |
| 5.666208267 | 1 | |
| 5.665445328 | 1 | |
| 5.663450718 | 1 | |
| 5.661117077 | 1 | |
| 5.660072803 | 1 | |
| 5.656788349 | 1 | |
| 5.652602673 | 1 |
| Distinct | 1992 |
|---|---|
| Distinct (%) | 45.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.794268 |
| Minimum | 0 |
|---|---|
| Maximum | 28.084936 |
| Zeros | 79 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.082886364 |
| Q1 | 21.721619 |
| median | 22.048611 |
| Q3 | 23.017941 |
| 95-th percentile | 25.954861 |
| Maximum | 28.084936 |
| Range | 28.084936 |
| Interquartile range (IQR) | 1.2963219 |
Descriptive statistics
| Standard deviation | 6.1425533 |
|---|---|
| Coefficient of variation (CV) | 0.29539647 |
| Kurtosis | 6.1429142 |
| Mean | 20.794268 |
| Median Absolute Deviation (MAD) | 0.96932983 |
| Skewness | -2.6683162 |
| Sum | 91182.864 |
| Variance | 37.730961 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 22.00520706 | 216 | 4.9% |
| 23.00347328 | 183 | 4.2% |
| 22.9745369 | 155 | 3.5% |
| 23.01794052 | 145 | 3.3% |
| 22.01967621 | 145 | 3.3% |
| 21.97627258 | 140 | 3.2% |
| 22.98900414 | 103 | 2.3% |
| 22.04861069 | 102 | 2.3% |
| 23.046875 | 87 | 2.0% |
| 0 | 79 | 1.8% |
| Other values (1982) | 3030 |
| Value | Count | Frequency (%) |
| 0 | 79 | |
| 0.002456213813 | 1 | < 0.1% |
| 0.01011194568 | 1 | < 0.1% |
| 0.01062385458 | 1 | < 0.1% |
| 0.01230416168 | 1 | < 0.1% |
| 0.01609536819 | 1 | < 0.1% |
| 0.01681616344 | 1 | < 0.1% |
| 0.0184647534 | 1 | < 0.1% |
| 0.01964788325 | 1 | < 0.1% |
| 0.02034074813 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 28.08493614 | 1 | < 0.1% |
| 28.05792236 | 1 | < 0.1% |
| 28.04745102 | 1 | < 0.1% |
| 28.04636955 | 1 | < 0.1% |
| 28.03819466 | 2 | < 0.1% |
| 28.02372551 | 5 | |
| 28.01240349 | 1 | < 0.1% |
| 28.00926018 | 5 | |
| 28.00657845 | 1 | < 0.1% |
| 28.00614929 | 1 | < 0.1% |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| FREQUENCIA BOMBA 1 | FREQUENCIA BOMBA 2 | FREQUENCIA BOMBA 3 | NIVEL DO RESERVATÓRIO - LT01 | VAZÃO DE ENTRADA- FT01 | VAZÃO DE GRAVIDADE - FT02 | VAZÃO DE RECALQUE - FT03 | PRESSÃO DE SUCÇÃO - PT01 | PRESSÃO DE RECALQUE - PT02 | |
|---|---|---|---|---|---|---|---|---|---|
| Timestamp | |||||||||
| 2018-01-01 18:00:00 | 49.404 | 0.000 | 0.000 | 3.747 | 0.000 | 108.857 | 87.449 | 4.809 | 16.963 |
| 2018-01-01 19:00:00 | 52.155 | 0.000 | 0.000 | 3.797 | 280.593 | 110.514 | 94.778 | 4.873 | 18.047 |
| 2018-01-01 20:00:00 | 51.284 | 0.000 | 0.000 | 3.918 | 279.349 | 109.423 | 91.448 | 5.002 | 17.925 |
| 2018-01-01 21:00:00 | 50.210 | 0.000 | 0.000 | 4.035 | 276.239 | 116.948 | 91.837 | 5.116 | 17.016 |
| 2018-01-02 18:00:00 | 56.769 | 0.000 | 0.000 | 4.083 | 279.826 | 128.389 | 105.958 | 5.077 | 20.014 |
| 2018-01-02 19:00:00 | 57.272 | 0.000 | 0.000 | 4.108 | 0.000 | 125.107 | 105.298 | 5.079 | 20.961 |
| 2018-01-02 20:00:00 | 56.636 | 0.000 | 0.000 | 3.730 | 0.000 | 127.512 | 103.857 | 4.704 | 20.048 |
| 2018-01-02 21:00:00 | 57.463 | 0.000 | 0.000 | 3.352 | 0.000 | 126.797 | 106.353 | 4.311 | 20.047 |
| 2018-01-03 18:00:00 | 57.936 | 0.000 | 0.000 | 4.263 | 269.974 | 131.095 | 110.580 | 5.218 | 20.043 |
| 2018-01-03 19:00:00 | 59.162 | 0.000 | 0.000 | 4.254 | 0.000 | 129.145 | 112.751 | 5.176 | 21.073 |
| FREQUENCIA BOMBA 1 | FREQUENCIA BOMBA 2 | FREQUENCIA BOMBA 3 | NIVEL DO RESERVATÓRIO - LT01 | VAZÃO DE ENTRADA- FT01 | VAZÃO DE GRAVIDADE - FT02 | VAZÃO DE RECALQUE - FT03 | PRESSÃO DE SUCÇÃO - PT01 | PRESSÃO DE RECALQUE - PT02 | |
|---|---|---|---|---|---|---|---|---|---|
| Timestamp | |||||||||
| 2020-12-29 20:00:00 | 57.708 | 45.828 | 0.000 | 3.810 | 0.116 | 140.620 | 115.867 | 4.689 | 22.989 |
| 2020-12-29 21:00:00 | 57.684 | 45.027 | 0.000 | 3.403 | 0.116 | 132.692 | 108.486 | 4.342 | 22.020 |
| 2020-12-30 18:00:00 | 57.989 | 46.421 | 0.000 | 3.668 | 0.116 | 149.632 | 118.490 | 4.525 | 23.018 |
| 2020-12-30 19:00:00 | 57.989 | 47.173 | 0.000 | 3.235 | 0.116 | 149.119 | 120.389 | 4.072 | 23.018 |
| 2020-12-30 20:00:00 | 57.989 | 47.556 | 0.000 | 2.806 | 0.116 | 140.063 | 117.885 | 3.660 | 23.018 |
| 2020-12-30 21:00:00 | 57.989 | 46.198 | 0.000 | 2.487 | 274.195 | 132.376 | 108.883 | 3.383 | 22.066 |
| 2020-12-31 18:00:00 | 57.989 | 45.858 | 0.000 | 4.122 | 0.116 | 154.669 | 118.099 | 4.976 | 23.047 |
| 2020-12-31 19:00:00 | 57.989 | 46.798 | 0.000 | 3.670 | 0.116 | 149.437 | 121.815 | 4.499 | 23.047 |
| 2020-12-31 20:00:00 | 57.989 | 47.061 | 0.000 | 3.221 | 0.116 | 151.018 | 117.933 | 4.071 | 23.047 |
| 2020-12-31 21:00:00 | 0.000 | 45.870 | 57.989 | 2.803 | 0.116 | 129.294 | 106.693 | 3.714 | 21.976 |
Most frequently occurring
| FREQUENCIA BOMBA 1 | FREQUENCIA BOMBA 2 | FREQUENCIA BOMBA 3 | NIVEL DO RESERVATÓRIO - LT01 | VAZÃO DE ENTRADA- FT01 | VAZÃO DE GRAVIDADE - FT02 | VAZÃO DE RECALQUE - FT03 | PRESSÃO DE SUCÇÃO - PT01 | PRESSÃO DE RECALQUE - PT02 | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.000 | 0.000 | 0.000 | 3.920 | 0.116 | 0.000 | 0.029 | 5.182 | 0.000 | 2 |
| 1 | 0.000 | 0.000 | 0.000 | 4.295 | 0.116 | 0.000 | 0.029 | 5.532 | 0.000 | 2 |